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Abstract. We study the adversarial satisfiability problem, where the adversary can choose whether variables 
are negated in clauses or not in order to make the resulting formula unsatisfiable. This is one case of a general 
class of adversarial optimization problems that often arise in practice and are algorithmically much harder than 
the standard optimization problems. We use the cavity method to compute large deviations of the entropy in the 
random satisfiability problem with respect to the negation-configurations. We conclude that in the thermodynamic 
limit the best strategy the adversary can adopt is extremely close to simply balancing the number of times every 
variable is and is not negated. We also conduct a numerical study of the problem, and find that there are very 
strong pre-asymptotic effects that are due to the fact that for small sizes exponential and factorial growth is hardly 
distinguishable. 



PACS numbers: 64.70.qd,75.50.Lk,89.70.Eg 
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1. Introduction 

The following setting often arises in practical optimization problems. Consider two players, each of them has a given 
set of moves (configurations) and a cost function depending on the moves of both the players. The first player is 
trying to optimize a certain cost function over his set of moves (configurations), and the interest of the second player 
is to make this optimum the worse possible. Consider that in the game at first the second player (adversary) chooses 
his moves (a configuration), and then the first player chooses his moves. What is the best strategy (algorithm) for 
the adversary in the case his set of moves is too large to be able to evaluate all the possibilities? A specific example 
of this adversarial optimization setting could be a police department trying to set border controls in such a way 
that the amount of goods smugglers can transfer is the smallest possible [1, 2]. 

Let us call the set of moves of the adversary u, and the set of moves of the first player v, the cost function 
being f{u^v). The goal of the adversary is to find Um that maximizes min^ f{u.v). Since in common situations 
both u and v have exponentially many components in the size of the system the adversarial optimization is much 
harder than usual (one-player) optimization, because even evaluating the min^/(u, tT) for a given u is typically an 
NP-hard problem. In the theory of algorithmic complexity the so-called NP problems are such for which it is easy 
(polynomial) to evaluate if a proposed solution is indeed a solution. In other words verifying a solution to an NP 
problem is a polynomial problem. Verifying a solution for a general adversarial optimization problem (as described 
above) is itself an NP problem. Without doubts theoretical understanding of hard optimization problems is crucial 
for many areas of science, and the same holds for the adversarial optimization. 

In optimization, the most famous benchmark of a hard problem is the /^-satisfiability (i^-SAT) of Boolean 
formulas. Call a clause a logical disjunction (operation "or") of K variables or their negations. Given a set of N 
Boolean variables Xi, and a set of M clauses, the satisfiability problem consists in deciding whether all clauses can 
be simultaneously satisfied. The /C-SAT problem was the first problem shown to be NP-complete, that is as hard 
as any other NP problem [3]. It has a large number of applications in automated verification and design. Random 
if -SAT problem, where variables in clauses are chosen randomly and negated with probability 1/2, provides easy 
to generate hard formulas [4]. 

Random /f-SAT thus became a common playground for new algorithms, and theoretical ideas for understanding 
the origin of algorithmic hardness. Statistical physics approach related to physics of diluted spin glasses contributed 
tremendously to the understanding of the properties of random ii'-SAT formulas, see e.g. [5, 6]. Following this path 
in this paper we introduce a random adversarial satisfiability problem and develop a statistical mechanics framework 
to understand its properties. This framework can be readily applied to other random adversarial optimization 
problems. We study the large deviation functions for the original optimization problem with respect to the moves 
of the adversary. The main ideas of our approach to the study of large deviations come from studies of spin glasses 
and the cavity method [7, 8, 9, 10]. Our approach is also closely linked to the well established fact that the replicated 
free energy in Parisi's replica symmetry breaking (RSB) [11, 12] can be interpreted as Legendre transformation of 
the large deviation function. We will, however, derive our method independently of these notions, only using the 
factor graph representation of the problem and belief propagation (BP) algorithm. 

One natural setting for random adversarial satisfiability problem is to introduce the negation- variables Jja, 
where Jia = 1 if variable i is negated in clause a, and Jia = if not. The set of moves for the adversary are then 
all the possible configurations of negations {Jia}, the moves of the second player are all possible configurations of 
the variables {xi}. The graph of interactions is chosen at random as before. The goal of the adversary is to set 
the negations in such a way that the resulting formula is the most frustrated possible. In particular we will be 
interested in the question: can the adversary make the formula unsatisfiable or not? We will call this problem 
random AdSAT. 

An independent interest in the random AdSAT comes from the study of random quantum satisfiability problem 
[13, 14, 15]. It was shown that if the adversary can make the formula unsatisfiable then also the quantum SAT is 
unsatisfiable. A natural question is then whether the quantum SAT is yet much more restrictive than the AdSAT 
or not? 

Note also that the quantified satisfiability (QSAT) problem is another SAT-based problem that naturally 
belongs to the general adversarial optimization setting. In QSAT problem we introduce two types of variables the 
existential variables Xi and the universal variables jji the QSAT then consists in deciding whether Vy 3x G{x,y) = 
TRUE , where G{x,y) is a satisfiability formula (with negation- variables fixed). Random ensembles of QSAT were 
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introduced and studied [16]. QSAT is arguably more important for industrial applications than the AdSAT that 
we study here. We chose the AdSAT defined as above because it is slightly simpler to treat and also provides 
information relevant to the original random SAT problem. We plan to apply our approach to the random QSAT 
problem in near future. Our study is also related to the work on optimization under uncertainty [17]. 

The present article is structured as follows: In Section 2 we set the problem of adversarial satisfiability, and 
describe our approach in generic statistical physics terms. In Section 3 we remind the standard belief and survey 
propagation (SP) equations for random satisfiability problem. In Section 4 we derive equations for calculation of the 
large deviations with respect to the negation-configurations. In Section 5 we first present and discuss the cavity result 
for random regular adversarial satisfiability, then we do the same for the canonical (Poissonian) random adversarial 
satisfiability. In Section 6 we compare our theoretical result to numerical simulation that include exhaustive search 
of all the solutions. Finally in Section 7 we conclude and discuss perspectives of this work. 

2. AdSAT as a large deviation calculation 

The random iiT-SAT problem is defined as follows, consider TV Boolean variables {xi}^^-^ ... ^, Xi = {0,1}, and 
M = aN clauses tpa- Each clause depends on K random variables from the A'' available ones. If a variable i belongs 
to clause ipa, then we set Jai = 1 if the variable is negated, and Jia = if it is not. K-SAT problem can be 
represented via a so-called factor graph, a bipartite graph between variables (variables nodes) and clauses (function 
nodes). With edges between variables i that belong to clause a. The negation- variables can be seen as attributes 
of the edges. The random JsT-SAT instance corresponds to the case where the J^^s are drawn uniformly at random. 
Probably, the most well known property of the random if-SAT is the existence of a phase transition at a value ac 
such that if a < ac then with high probability (probability going to one as N ^ oo) there exists a configuration 
{xi} that satisfies all the clauses, and for a > ac no such configuration exists with high probability. We define di as 
the ensemble of function nodes connected to the variable node i, da as the ensemble of the variable nodes connected 
to the function node ipa- 

The adversarial satisfiability problem (AdSAT) is defined by drawing a random K-SAT instance as before 
without deciding the negation- variables {Jia} = J . A solution to the AdSAT problem is given by a set J such that 
the resulting instance is unsatisfiable. Just as in random if-SAT there is a threshold in the random AdSAT such 
that for a < ao no solution to the AdSAT formula exist with high probability. And for a > a solution exists 
with high probability. We observe < since above ac a random configuration of negations makes the formula 
unsatisfiable, recall adK = 3) = 4.2667 [18]. Also aa > ap, where ap = 1/K is the percolation threshold below 
which the graph is basically a collection of small trees and few single loop components, which are both satisfiable 
for any configuration of negations. One of the goals of the present paper is to estimate the value of the AdSAT 
threshold aa- 

In random K-SAT the satisfiability threshold can be found by counting the number of configurations that have 
a certain energy E{{xi}) (i.e. number of unsatisfied clauses). To compute the entropy one introduces a Legendre 
parameter /3 and computes the free energy / defined as 

g-/3iV/(/3) = g~|3E{{'^^}) = gJVl6(e)-/3e] ^ M^) ^ ^ ^ 

where the number of configurations having energy E is e^'-^\ If E = belongs to the support of the function S{E) 
then the problem is in the satisfiable phase, if not than the problem is in the unsatisfiable phase. In the satisfiable 
phase call s = S{0)/N the entropy of satisfying configurations. The cavity method and the replica symmetry 
breaking serve to compute /(/?) in the thermodynamic limit iV — >■ cxd [19, 5]. There are two crucial properties that 
make this calculation possible: First, the energy can be written as a sum of local terms 

E{{xi}) = E n ^^^^j-^ (2) 

Second, the underlying factor graph is locally tree-like. These computations moreover provide much more 
information about the problem than the value of the satisfiability threshold. 
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In the study of random AdSAT we will proceed analogously. We consider the number of configurations of the 
negations that yield a given value of the entropy of solutions s 



s{J) = ^loi 



M / 

En i-n^-^=. 

{xi} a=l \ ieda 



(3) 



We define a large deviation function £{s) as the logarithm of this number divided by the size of the system N. 
Again to compute £(s) it is advantageous to introduce its Legendre transform 

Z{x) = e^*(^) = ^ e"^^(^) = e^[^(^)+^^] , = -x . (4) 

We stress here that C{s) is the large deviation function with respect to the negation-configurations, it it hence defined 
for a given geometry of the satisfiability formula. In what follows we assume, as is usual, that £{s) is self-averaging 
with respect to the formula geometry, i.e. C{s) is almost surely the same function for two randomly chosen formulas. 
This assumption is a generalization of the self-averaging property of the free energy in the canonical K-SAT problem. 
Note also that in writing this expression we implicitly assume that the number of negation-configurations that give 
a certain entropy is exponential in N. If it is smaller that exponential in N computation of ^{x) will lead to 
yC(s) = — cxD. We will come back to this point in Sec. 6. 

Remark two special cases: For a; = the partition function (4) is simply equal to the total number of negation- 
configurations $(0) = Ka log 2. For x = 1, the partition function above is related to the annealed partition function, 
$(1) = log 2 + a log (2^ - 1) . 

The major difficulty in calculating $(x), for a general value of x, is that the entropy s{J) is not defined as a 
sum of local terms. On the other hand the geometry of the underlying factor graph is still tree-like in the random 
AdSAT, hence for any configuration of negations J7 we can apply the cavity method (with replica symmetry breaking 
if needed) to compute the entropy s{J). In the cavity method, as it is reminiscent of the Bethe approximation, 
the entropy (or more generally Bethe free energy) can be written as a sum of local terms. This fact enables us to 
calculate <f>(x). 

The statistical physics treatment of the random if-SAT problem among others led to a discovery that replica 
symmetry breaking approach is needed [19, 5, 6] in order to correctly compute the entropy close to the satisfiability 
threshold etc. Said in other words, in that region the space of solutions splits into well crgodically separated clusters. 
We define the complexity function, E, as the logarithm of the total number of clusters per variable. The value of 
the complexity can then be computed with the survey propagation equations [5]. At the satisfiability threshold the 
complexity goes to zero, whereas the entropy density of solutions is a positive number even at the threshold. With 
this in mind it will be useful to define also 



e 



;p(E)-l-a;E] 



where £sp(S) is the entropy density of negation-configurations that give a certain complexity function S. 



3. Reminder of equations for belief and survey propagation 

With the notation introduced in the previous Section we write the belief BP equations and the Bethe entropy as 
derived e.g. in [20, 21]. These equations are asymptotically exact on locally tree-like graphs as long as all correlation 
length-scales are finite. If they are not then splitting the phase space into clusters such that within each cluster the 
correlations decay again might be possible. SP then estimates the total number of such clusters [5], and it does so 
asymptotically exactly at least close enough to the satisfiability threshold [22, 6]. 

Denoting by {mia,niai} the BP (SP) messages, we write the BP (SP) fixed point equations as 

mia = giailrribijbediXa, {Jbi}bedi), (6) 
'Thai = 9ai{{'mja}jeda\ij {Jja}jeda)- 
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In the BP case, the messages read [21] rriia = i^lJ, rhai = {i>°j, i>^J, and 



n 



9li{{'^ja}jeda\ii {Jja}jeda) 



Ylbedi\a ^bi + IlbGaiXa ^bi 
2 ~ rijeaaXi ^la 



\~j^da\i ja 

where r = 0, 1. In the SP case, rriia = {Qfa, Qfa^ QL}^ "^q* = Qai and 

5ia({"^6i}be9i> {Q6i}6eai\a) = C' (1 - Qfej), 



1 - [] (1 - Qbi) 

beSic 

1 - n (1 - ^fi) 



gai{{Qja}jeda\i) 



beSia 

= n 

j&da\i 

where C is a normalization constant enforcing the relation g*a+ dfa'^ 9ui — ^' and Sia, Uia are defined as 

if Jia =0 Sia = dni\{a}, Uia = dii 
if Jia = 1 = 9ii\{a}, Uia = doi, 

where 9o/ii = {a € Sisuch that J^a = 0/1}. 

If {mia-,rhai} is a fixed point of Eqs. (6), the Bethe entropy for BP and the complexity for SP are 
written in a general form 

M JV 

a=l i=l {a,i) 

where for BP 



^a{{Via, Jia}ieda) = log I 1 - J| i^^*" j , 

Si{{i>ai,Jia}aedi) = log H ''bt + Yl ^bi ] 
\bedi bedi I 



^aiiyiai ^ai) 



log {vl^vli + vla^lil , 



while for SP 



Sa{{Qia,Jia}ieaa) = log | 1 - [] Q^^ 

jeda 



^i{{Qai,Jia}aedi) = log 
Sai{Qia,Qai) = log (l - Qf^Q 



n (1 - Qbi) + n (1 - <3f^) - n (1 - 's^i) 

.b^doi bS.dii 6e9i 
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Figure 1. The auxiliary fax;tor graph describing Eq. (14). Empty circles and empty Squares represent variable 
and function nodes of the original factor graph of random K-SAT. Empty squares with black-filled squares inside 

represent the function nodes, empty circles with black-filled squares inside represent function nodes of the 
auxiliary graphs. Finally, black-filled squares represent function nodes, while black-filled circles WiQ-variable 
nodes of the auxiliary graph. An edge connecting (jjo to a function node of the auxiliary graph means that such a 
function node depends on ojia- 



4. Computation of the large deviations function 



The most important formula of the previous Section is (10), in certain regimes it gives the asymptotically exact 
entropy or complexity in a form factorized in local terms. The remaining complication is that now everything 
depends on the fixed point of the BP (SP) equations. We can, however, write 



(13) 



J 
X 



Yi ^i'^ia - 9ia{{mbi}bedi\a, {Jbi}bedi)) S{m 

ai 9ai 
{i,a) 



If we now introduce auxiliary variables Uia = {Jia, fnia-, 'rhai} the free energy defined by (4) can be re- written in the 
common local form 



Z{x) = E 

Ma} 



M 



n *a(Ma}ie da) 



.a=l 



N 



]][*i({Wja}aeai) 



.j=l 



n 

(a,i) 



(14) 



where 



n 5{mai - gai{{inja}jeda\i,Uja}jeaa)), 
JJ S{mia - gia{{lTT'bi}bedi\a,{Jbi}bedi)), 



(15) 



-xSaiiinia,rhai) 



In Eq. (14) and in the following, the sum over cj,;„ stands for the sum over and the integral over m,„, Too, . 

The probability measure in Eq. (14) is local and can be hence represented with an auxiliary factor graph that 
can be viewed as decorating the original K-SAT factor graph. Fig. 1 depicts this construction. 

The partition function Z(x) can now be computed by implementing the general BP formalism to the auxiliary 
graph, just as it is done in the derivation of the IRSB equations in [21] (note indeed the close formal resemblance 
of our approach and the IRSB equations). We call Sia{ujia) the message going from the variable node ia to the 
function node a, and Sai{ijJia) the message going from the variable node ia to the function node i. BP equations 
on the auxiliary factor graph on the variables LUia then lead to fixed-point equations for these messages 

Sai{<JJia) ^ ^ [Zai{{mja,Jja}]eaa,mai)r Yi ^^^'^3 ^ 9aj{{mka}keda\j, {Jkajkeda)) X 

X n ^i-i^oa), (16) 
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Siai'^ia) - X! [ziaiirhbi, Jib}bedi,rnia)Y I{mib = gib{{rhci}cedi\bAJci)cedi)) 

{'^ib}bf^di\cL bedi 
bedi\a 



where 



liirhbi, Jib}bGdi, 
^\\"''Ja: '^ja JjGaai '"^atj — ^ 



Eq. (16) can be further simphfied. It is easy to check that when the fixed point Eqs. (6) hold, the term Zia 
(resp. Zai) does not depend on the 'backward' messages rriia (resp. rhai)- Using this result, we can see that (16) 
are compatible with a choice of the messages 5^^, Sai depending only on rriia, Jia and only on rhai, Jia respectively. 
Indeed, if we assume that Sia{uJia) = S,a{mia, Jai), and Sai{uJia) = Sai{rhai, Jai) then (16) becomes 

Siaimia,Jai) ~ ^ j ^ dmbiSbi{mbi, Jbi) [Ziai{rhbi, Jib}bedi\a)T >< 

{Jbi}b€di\a bedi\a 

X S{mia - gia{{mbi}bedi\a, {Jbijbedi)) 

Saiij^ai, Jai) — ^ ] j H drUjaSjairnja, Jja) \Zai{{rnja, J ja} jeda\i)\ ^ 
{Jaj}j(iaa\i jeda\i 

X ^(jTlai - 9ai{{'mja}je^a\^,{Jja}je^a))■ 
The free energy $(a;) can then be computed using the general expression for the Bethe free-entropy [21] 

M N 

N^x)=J2^a + J2^^-J2^ia, (19) 

(i,a) 



(17) 



(18) 



a=l 



where 



F„ = log 



j2 J n ^"^-^»("^»"^-)^"'°"'"""'"^'''"' 



F- 



log 



log 



{Jia}i<, 

E 



dihaiSaiijhai, Jia)S- 



xSi{{mai}a€ai) 



aGui 



^ / dmiadrhaiSia (P^ia, Jia)Sai{lTla 

iJia 



J V' 



(20) 



(21) 



(22) 



Equations (17-18) clearly show the formal analogy with IRSB cavity equations [21]. The difference between 
Eqs. (17-18) and the latter is that in the AdSAT case the negations are considered as physical degrees of freedom of 
the partition function Z{x), and the resulting cavity equations (17-18) consist in a weighted average of the quantities 
[Zia{{mM, Jib}bedi\a)T , [zai({mja,Jja}jeda\dT over {Jia, ma, rhai}- On the contrary, in the IRSB case the only 
degrees of freedom are the BP messages {rriia, Aai} in such a way that the resulting IRSB cavity equations consist 
in an average over rriia, "rhai at fixed JjoS. 

The biggest advantage of the formal resemblance of Eqs. (17-18) to the IRSB cavity equations is that in 
order to solve (17-18) numerically we can use the very same technique and all the related knowledge as in the 
case of IRSB. We indeed use the population dynamics [19], where the distributions Sia{mia, Jia), Sai{rhai, Jia) are 
represented as populations of P messages. When the size of the population P is large, we expect the populations 
to reproduce well the distributions Siairriia, Jia), and Sai {rhai, Jia)- The cavity equations (17-18) can be written 
in terms of these populations. Starting from a given initial configuration, the iteration of the cavity equations 
yields the fixed-point populations satisfying (17-18). Once this fixed point is achieved the free-energy ^{x) can 
be computed numerically by means of (19-22). This is repeated for different values of x and finally the Legendre 
transform £{s) is evaluated. Everything is done in the very same way the IRSB equations are usually solved, for 
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more details see e.g. [19, 23, 24]. The only diflFerence is in the treatment of the negation- variables. In our case one 
writes the distributions Sia{mia, Jia), Sat{rhai, Jia) as 

^iaij^iai Jia) — '^Sia{rnia\Jia) ^ Saiirflai^ Jia) — '^S(ii{lJlai\Jia)i 

because Sia{Jia) = J dmiaSia{m,ia, Jia) = 1/2 = SaiiJia) as can be seen by expHcitly integrating Eqs. (17- 
18). We then introduce a pair of populations {Sf^is], S;l^[s]}s=l,■■■,p representing the probability distributions 
Sia{'mia\J), Sai{rhai\J) respectively. The population dynamics is then implemented in terms of such populations, 
and the resulting fixed point investigated numerically. 

5. Cavity method results for random AdSAT 

In this Section we present the solution of the cavity equations (17-18), and its implications for the random AdSAT 
problem. 

5.1. Large deviations of the entropy and complexity on regular instances 

Before addressing the random AdSAT as defined in Sec. 1 wc will study it on random regular instances. On L-rcgiilar 
instances every variable belongs to exactly L clauses. A random L-rcgular instance is chosen uniformly at random 
from all possible ones with given number of variables N and number of clauses M, provided that KM = LN. 

In the 3-SAT problem discussed in this paper the Bethe entropy is asymptotically exact only as long as the BP 
equations converge to a fixed point [6] , the non-convergence is equivalent to the spin glass instability, or a continuous 
transition to replica symmetry breaking phase. On random regular graphs with random values of negations, BP 
stops converging at L = 12 (this is the reason why these and larger value are omitted from Table 1), meaning that 
for L > 12 there is a need for SP (or other form of replica symmetry breaking solution). For L > 15 BP iterations 
for random regular 3-SAT lead to contradictions (zero normalizations) meaning that in this region the large random 
instances are almost surely unsatisfiable. Survey propagation on random regular 3-SAT has a trivial fixed point for 
L < 12, a fixed point with for L = I'd with the value of complexity S(L — 13) = 0.008. SP does not converge for 
L > 14, if we ignore non-convergence, and compute the complexity from the current values of messages we get on 
average T,{L = 14) = —0.03. This means that L = 13 is the largest satisfiable case. 

The great advantage of random regular instances is that topologically the local neighborhood looks the same for 
every variable i. Moreover, we remind that in regimes where the BP equations are asymptotically exact properties 
of variable i depend only on the structure of the local neighborhood of i. Hence on regular graphs all the quantities 
in Eqs. (17-18) are independent of the indices and a, b. This so-called factorization property simplifies crucial 
numerical solution of Eqs. (17-18), the 2KM distributions {Sia{i'ia, Jia), Saii^ai, Jai)} reduce to only 2 distributions 
S{y, J), S{y, J). Moreover, the thermodynamic limit is taken directly without increasing the computational effort. 
(To avoid confusion, we remind here that for the canonical random K-SAT problem where negation-variables are 
chosen uniformly at random and fixed, the BP solution is not factorized. In the adversarial version one sums over 
the negation-variables, hence the factorization.) 

In Fig. 2 we show the large deviation function £(s) of the Bethe entropy s obtained by the population dynamics 
over BP messages on regular graphs with K = i and variable degree L = 4, i.e. by solving Eqs. (17-18) and (19- 
22). First of all, in the "infinite temperature" case, i.e. when the Legendre parameter a; = 0, that is at the 
maximum of £(s), we recover the logarithm of the total number of negation-configurations C{x = 0) = Llog2. 
The corresponding value of entropy s{x = 0) = Sran is the Bethe entropy for a random choice of negations (values 
summarized in Table 1). 

The inset of the figure shows that as the Legendre parameter x — )■ ±oo both C and s converge to well defined 
ending points (the same data in a logarithmic plot show that the convergence is exponential). Let us denote the 
lowest entropy ending point (left, x — >■ — oo) {sl,jCl), and the highest entropy ending point (right, x — >■ oo) {sr, £r). 
We observe systematically that the value of sr — su, where su is the entropy of the uniform negation-configuration 
which is obtained by computing a fixed point of Eqs. (7-8) such that — v V6, i, Vja ~ v "ij, a and Jia = Vi, a, 
and plugging it into Eq. (10), values summarized in Table 1. An edge independent fixed point of the BP equations is 
called factorized. We realize that su also corresponds to the value of the Bethe entropy when every variable is either 
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Figure 2. Left; The large deviation function C{s) vs. Bethe entropy s computed by population dynamics on 
regular graphs with X = 3, L = 4, population size P = 10*. The left ending point (/i, Cl) corresponds to balanced 
configurations of negations, whereas the right ending point {fR,Cji) to the polarized configurations of negations 
(more in the text). Right: The large deviation function £sp(E) vs. the complexity S computed by population 
dynamics on regular graphs with K = 3, L = 13, P = 10* (bottom), and K = 3, L = 12, P = 7.5 ■ 10* (top). 



L 




SB 


SR 


2 


0.6039 


0.5710 


0.6196 


3 


0.5592 


0.5324 


0.5975 


4 


0.5134 


0.4488 


0.5796 


5 


0.4686 


0.4120 


0.5644 


6 


0.4220 


0.3266 


0.5513 


7 


0.3750 


0.2902 


0.5397 


8 


0.3302 


0.2044 


0.5293 


9 


0.2816 


0.1677 


0.5199 


10 


0.2319 


0.082 * 


0.5114 


11 


0.1813 


0.042 * 


0.5035 


12 


0.128 * 


X 


0.4962 


13 


0.07 * 


X 


0.4894 


14 


X 


X 


0.4831 



Table 1. Bethe free-entropy on regular instances with random negation-configuration (sran), with balanced negation- 
configurations (sb), and non-frustrated with Jia = OVj,a (sn). The entropy for the non-frustrated case and for the 
balanced case for even degree L < 10 can be computed analytically since the BP fixed is factorized in these cases. In 
the other cases we iterate BP equations on large random graphs and compute the entropy from the corresponding 
BP fixed point. The star signals that BP did not converge and the value of entropy was obtained by averaging over 
an interval of time. The X means that BP converged to contradictions for these densities of constraints. 

always negated or never negated, we call such negation-configurations polarized. There are 2^^ polarized negation- 
configurations, and indeed the logarithm of the number of such choices corresponds to the value of Cfi = log 2. 
Intuitively such configurations of negation are frustrating the formula in the least possible way. And Fig. 2 shows 
that such intuition is asymptotically exact in this case. 

Similarly for the lowest entropy ending point, for even values of the degree L, we realize that Cl — log (^^2) 
and sl corresponds to a value sb that is obtained from a factorized BP fixed point when each variable is L/2 times 
negated and L/2 times non- negated, values are summarized in Table 1. Such balanced configurations of negations 
locally frustrate the variables in a maximal way (half clauses want the variable to be 1, the other half 0). And the 
computation presented in Fig. 2 suggests that asymptotically there are no correlated negation configurations that 
would frustrate the formula even more and decrease the value of the entropy further. 

We investigate more in detail the result following from Fig. 2. i.e. that the most frustrated configurations of 
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negations on the regular graphs with even degree are the balanced negations, we denote the balanced negation- 
configurations {Jiajs = J^B- There is (^^2) such negation-configurations. Does our result mean that all of 
them lead to the same number of solutions A/'(J7b)? We will see in Sec. 6 that this is not true for finite A''. The 
correct conclusion of the result presented in Fig. 2 is that MmN^aoi^og N'{J'b)]/N ~ sb ~ sl independently of 
the realization of Jb ■ This can also be seen directly from the solution of the BP equations on the formulas with 
balanced negations. Indeed, the fixed point of the BP equations is factorized and independent of the realization 
of negations and also of the size of the graph. Wc tried numerically formulas of various sizes and many possible 
realizations of balanced negations and for even degree L < 10 BP always converges to the factorized fixed point (at 
L = 10 BP stops converging, as we will discuss later in the paper) giving always the same Bethe entropy density 
Sb- Further discussion about the true entropy fluctuations compared to the constant Bethe entropy in this case 
will be presented in Sec. 6 

For regular graphs with odd degree L we cannot achieve ideal balancing of every variable. Instead, we call 
a configuration of negations balanced if for every variables there is either (L — l)/2or (L-|-l)/2 negations. The 

total number of such configurations is then 2^((-^_^j^^y2) • fixed point on the balanced instances for odd L is 
not factorized anymore. We can, however, solve the cavity equations (17-18) restricted to only balanced values of 
negations and we obtain that within the error-bars of the numerical resolution of the equations (that are less than 
1%) all the balanced configurations give the same value of Bethe entropy also in the odd L case. 

Our results for large deviations of the Bethe entropy lead to a conclusion that for the regular instances and 
in the limit N ^ oc the most frustrated formulas are all those with balanced configurations of negations. Let us 
hence conclude this Section by summarizing the properties of regular SAT instances with balanced negations. BP 
on balanced instances converges for L < 9, and leads to contradictions for L > 13. Survey propagation on balanced 
regular instances has a trivial fixed point for L < 9, for L = 10 a fixed point with complexity T,b{L = 10) = 0.018, 
for L > 11 the complexity is negative (e.g. 'EbIL = 11) = -0.001, ^^(L = 12) = -0.075). 

For completeness, we also computed the large deviations of the complexity function. That is, we solved Eqs. (17- 
18) using SP as the basic message passing scheme. Fig. 2 right shows some of the results for L = 12 and L = 13, 
we indeed see that there arc configurations of negations that lead to negative complexity. Unfortunately, it is hard 
to extract any information from these curves for very negative values of x, because of the noise introduced by the 
finite population-size effects. This also poses a problem for L = 10 and 11 where we know that a non-trivial fixed 
point of SP exists for the balanced configurations of negations. In the population dynamics we should hence see a 
non-trivial solution for very negative values of x. Instead wc were only able to obtain very noisy and inconclusive 
data from the population dynamics with population sizes up to 7.5 • lO**. For L = 10 the SP equations have only one 
factorized fixed point for all the balanced configurations of negations, this again strongly suggests that instances 
with balanced negations are the most frustrated ones, and hence that for L — 10 the adversary cannot make large 
formulas unsatisfiable. For lower values of L < 9 the population dynamics has always only a trivial fixed point 
given by S,a{Q,a- -ha) = ^{Qfa)S(QZ)S(Ql - l)/2. SaAQa^, -ha) = HQa,)/'^ yielding $(,t) = Ka\og2. SP is hence 
not very useful in this case to obtain new information about the random AdSAT problem. 

In summary, for L > 11 the adversary will succeed to make a large formula unsatisfiable by simply balancing 
the negations (for L > 14 a random choice of negation would do). On the other hand, following our previous 
conclusion that the balanced formulas are the most frustrated ones, for L < 10 the adversary will not be able to 
make large random regular SAT instances unsatisfiable by adjusting the values of negation-variables. 

5.2. Results for random AdSAT, i.e. instances with Poisson degree distribution 

In the most commonly considered ensemble of the random if-satisfiability problem the triple of variables appearing 
in each clause is chosen independently at random (avoiding repetitions). For large system sizes this procedure 
generates Poissonian degree distribution with mean a. In this case every node has a different local neighborhood 
and hence the fixed point of Eqs. (17-18) is not factorized, instead the distribution Sia{mia, Jai) is different on 
every edge. We hence solve Eqs. (17-18) by generating an instance of the problem (graph) of size N, associating one 
population of size P with every directed edge and iterating following Eqs. (17-18). This is more computationally 
involved and we are able to treat only modestly large and P, typically several hundreds. The resulting large 
deviation function C{s) is depicted in Fig. 3 for several values of constraint density a. 



Adversarial Satisfiability Problem 



11 



For low values of the constraint density, e.g. a = 1 in Fig. 3, the location of the right (large entropy, least 
frustrated) ending point {sn-Cn) corresponds, as in the case of random regular instances, to the value of Bethe 
entropy that is obtained if no negations are present in the instance (Jjo — for all ia), and Cb. = (1 — e^^") log 2 
(corresponding to the number of negation-configurations where no variable is locally frustrated). For larger values 
of the constraint density, e.g. a = 2 in Fig. 3, the results from population sizes as large we were able to achieve are 
very noisy for large values of x w 100. We observed that the data are getting smoother as the population size is 
growing, however, not enough to be able to conclude from these data whether (s/{,>Ci{) is the right ending point. 

The part of the curve corresponding to a very large negative parameter x does not converge to an ending point. 
Instead at some xq the large deviation function C{s) ceases to be concave, and an unphysical branch appears for 
X < xo- This unphysical branch is not present on random regular instances with even degree, when the degree is odd 
the data for large negative x are inconclusive in the sense that we might see a unphysical branch or only a numerical 
noise. We define the left ending point as the extreme of the physical branch sl = s{xq), and jCl = jC{xo). And 
we observe systematically that in the region of interest (say for a > 1) the values sl and jCl are very close to the 
values corresponding to balanced instances {sb,jO-b)- In balanced instances each variable is negated as many times 
as non-negated (for variables of odd degree the absolute value of the difference between the number of negations 
and non-negations is 1). In the thermodynamic limit the number of such balanced negation-configurations is 

i=0 ^ ^ ^ ^ i=0 

We made a number of attempts to obtain a value of entropy considerably smaller than the balanced entropy, 
s < SB- First, wc removed the leaves from the formula and balanced only the residual formula. This indeed leads to 
a lower value of entropy but for a > 1 the difference was less than 1%. We tested the population dynamics limited 
to the balanced negation configurations, i.e. we solved equations (17-22) where the sum over the negation-variables 
in Eqs. (18) and (21) was limited only to the balanced negation configurations. The large deviation function C{s) 
obtained this way did not differ more than by 1% from the value {sb,j0.b), see Fig. 3 left. We also investigated 
the results of population dynamics over the SP equations and we were not able to find cases where the complexity 
would decrease by more than 1% below the complexity value on the balanced instances. We also tried simulated 
annealing on the negation-variables using the Bethe entropy as the cost function, with the same result. All this 
makes us conclude that with at least 1% of precision the satisfiability threshold for random adversarial SAT equals 
the satisfiability threshold of the balanced random ensemble. 

Let us hence summarize results about BP and SP for the random satisfiability problem with balanced 
configurations of negations. For K = 3 the BP ceases to converge for a > 2.96. SP starts to converge to a 
nontrivial fixed point for a > 3.20, and the complexity decreases to zero at 

as = 3.399 ± .001 (24) 

this is hence the satisfiability threshold on the balanced random formulas. All our observations about the large 
deviation function suggest that the threshold for random adversarial satisfiability problem satisfies aa > 3.39. 



2i + l 
i 



(2z + l)!' 



(23) 



6. Numerical results for AdSAT and large deviations 

In this final Section we compare theoretical predictions from the cavity method with numerical results. 



6.1. Numerical results for large deviations 

First, we investigate numerically the number of configurations of negations yielding a formula with a certain entropy 
of solutions. For one given random graph geometry of size N, we generate independently at random 7^1 
different configurations of negations, and for each of them we count the number of solutions using a publicly 
available implementation of exact counting algorithm relsat [25]. We define probability Pn{s) over the negation- 
configurations that the value of the entropy density was between s and s + As, where As is a binning interval that 
we will specify later. 

Following the assumption of exponentially large deviation function made in Eq. (4) , we define 

£jv(s) = ^ logPiv(s) - logmaxPjv(s) • (25) 
iV L s J 
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Figure 3. Left; The large deviation function C(s) vs. entropy s computed by population dynamics on Poissonian 
graphs with random negation configurations, for K = 3, various values of the constraint density a, and both positive 
and negative x. For a = 1, 2, 3 the part of the curve with a; < has been computed with N = P = 300. For a = I 
the part of the curve with a; > reaches the right ending point, and has been computed with N = P = 500. For 
a = 2,3 the part of the curve with positive x has been computed with N = P = 700, and it does not reach the 
right ending point, even larger A'^ and P are needed to remove the noise from the data for very large positive x. For 
X < 0, an unphysical branch (concave part of the curve) starts at xq —42, xq ^ —44, xq ~ —40 for q = 1,2, 3 
respectively. Points indicate the values for balanced configurations of negations (sgjCg), and for configurations of 
negations where all the variables are non-negated (sii,Cii). 

Right: Zoom at the large deviation function C{s) vs s for a = 1 near to the low-entropy ending point for 
N = P = 500. The data become noisy near to the low entropy ending point, larger graph and population sizes lead 
to an improvement. We plotted the data down to the lowest value of entropy s, hence the unphysical branch is not 
plotted. The point indicates the value {sb,Cb) for balanced configurations of negations. The blue data points is 
the large deviation function restricted to balanced configurations of negations for A' = 10^, P = 10^. Notice the 
narrow range of entropies s plotted and how little the lowest entropy we achieved differs from the balanced value. 



The numerical result for £jv(s) is depicted in Fig. 4 left for L = 8, and compared to the prediction of large deviation 
of the Bethe entropy from Section 5. The agreement between the numerical data point and the theoretical prediction 
is not good in the low entropy region. One possibility is that this is due to pre- asymptotic effects, on the other 
hand this does not seem likely as the numerical curves seem to superpose nicely for different system sizes. Another 
possibility is that we neglected some replica symmetry breaking effects, note, however, that the large deviation 
calculation over survey propagation did not provide any non-trivial result. We hence leave this disagreement as an 
open problem. 

At this point we want to recall the result from BP we obtained on balanced regular instances with even degree 
(i.e. for instance L = 8), in that case the BP fixed point was factorized and independent of the negation-configuration 
even for small graphs. Let us hence investigate the numerical results for entropy large deviations in this case. The 
data for Cn{s) are depicted in Fig. 4 right, recall from Table 1 that the maximum of the curve corresponds to sb 
obtained with BP. The curves in Fig. 4 right clearly do not superpose for different system sizes. Instead the Cn{s) 
seems to be 'closing'. From these data it is indeed plausible that in the limit N oo, Cn{s) converges to a delta 
function on the value of entropy s = sb- 

Hence, data in Fig. 4 right suggest that the probability that the entropy of a formula is different from the value 
predicted by BP is smaller than exponentially small. This makes us conclude that in a general case, the probability 
of having an entropy outside of the interval (sl,s_r) is smaller than exponentially small (we remind that for the 
balanced negations and even degree L regular graphs sl — sb. — sb)- Hence in the thermodynamic limit there are 
almost surely no negation-configurations that would lead to a value of entropy outside the range {sl, sb)- 

Moreover, the large deviation function Cn{s) if asymptotically negative can be interpreted as a probability of 
generating a rare graph and configuration of negations having entropy s [26]. Since there are of order regular 
graphs, and there is none or at least one graph with entropy s ^ (sl, Si?,), we can have either Pn{s) = or 

PAr(s) > e~=i^i°«^ (26) 

where Ci is some positive constant. Consider now that there is e^^ of configurations of negations (e.g. C = Ka log 2 
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Figure 4. Left: C]\f{s) vs s computed exactly for 15 < Af < 60 with Ns = 10^ samples ^7, and binning interval 
As = 10"^, and C{s) vs s computed with the population dynamics for both negative x {P = 5 ■ lO**) and positive x 
[P = 2- 10"), with _fC = 3, L = 8. 

Right: Cff{s) vs s computed exactly for 21 < A' < 75, Ns = 10^, binning interval As = 10~^, and K = 3, L = 8. 
The curves don't superpose for / < /bi so the left-large deviations are faster than exponential. 



if we consider all the negations-configurations, or C — Cb if we consider just the balanced negations-configurations). 
The fraction of graphs with configurations of negations leading to entropy s ^ (s^, sr) has to be small only if 

Pjv(s)e^^' « 1. (27) 

If an equality holds in Eq. (26) then Eq. (27) holds in the thermodynamic limit, N ^ oo. However, (27) does not 
have to hold for finite N unless 

N>Nc = exp . (28) 

Since ci can be considerably smaller than C , the crossover value of Nc might be very large and out of reach for 
exact numerical methods. This justifies the presence of strong pre-asymptotic effects for system sizes treated in 
Fig. 4. 



6.2. Strong finite size corrections for AdSAT threshold 

We investigate numerically the AdSat threshold aa by computing the probability (over random graph instances) ps 
that an adversary is not able to find a configuration of negations that makes the formula unsatisfiable. We do this 
in the regular instances because of reduced fluctuations that arise because of the randomness of the graph. 

We generate / ^ 1 regular instances for each value of the degree L and for each size N. Then for each instance 
we use simulated annealing on the negation-variables in order to minimize the number of solutions, we monitor if 
an unsatisfiable formula is generated or not. This general strategy for AdSAT was suggested by [27]. In particular, 
we introduce an inverse temperature /3. Initially we set /? = 1. We choose randomly one of the negation- variables, 
Jia, and attempt to flip it, i.e. to set Jia — ^ 1 — Jia- Denoting by J' the configuration of negations after this fiip, we 
accept the flip with probability max{l, e'^(*-^'~*'^)}. The entropy sj is computed exactly with a publicly available 
implementation of exact exhaustive search algorithm relsat [25]. This algorithm has an exponential running time 
in the size of the system, limiting us to very small system sizes. Attempting for N negation flips is one Monte-Carlo 
(MC) step. Every 10 MC steps we multiply the inverse temperature by a rate factor r > I. We keep track of the so 
far minimal value of entropy Smin and the index no of the MC step in which it was first found. The algorithm stops 
if either an unsatisfiable instance was encountered or no further decrease in the value of entropy Smin has occurred 
in the last 9 • riQ + 50 MC steps. The probability ps plotted in Fig. 5 is then given by the fraction of cases in which 
an unsatisfiable instance was not found. 
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Figure 5. Probability pg that using the simulated annealing algorithm described in the text we did not find 
any unsatisfiable configuration of negations for L-regular instances, as a function of L, for different systems sizes 
9 < < 54. We used the annealing rate r = 1.1, and number of instances / = 100. 

There is of course no guarantee that our algorithm found the actual minimal possible entropy. So, strictly 
speaking, any result for the satisfiability threshold derived from the data for ps is only an upper bound to the true 
threshold. However, given the strictness of our stopping condition we have a reasonable confidence that our results 
are very close to the exact result. Fig. 5 depicts the fraction of regular instances of size N where we were unable to 
find a configuration of negations that would make the formula unsatisfiable. 

On a first sight numerical data in Fig. 5 do not agree with our theoretical predictions. Indeed, we predicted 
that unsatisfiable configurations of negations exist only for L > 11, whereas, on the system sizes our simulated 
annealing algorithm was able to treat, we find unsatisfiable negation-configurations for a large fraction of graphs 
with L > 8. 

On a second sight, however, we see in Fig. 5 that for L = 6 and L = 7 there are very strong finite size corrections 
to Ps- Indeed, for L = 6 and size = 9 we find that roughly 3/4 of the instances can be made unsatisfiable, whereas 
for = 36 none of the / — 100 instances that we tried can be made unsatisfiable. Similarly for L = 7 and size 
= 36 we find that most of the / = 100 instances can be made unsatisfiable, whereas for TV = 54 almost none 
of them. If this trend continues it is perfectly plausible that in the N ^ oo limit even for L = 10 the adversary 
is never successful. These results, with agreement with conclusions of the previous section, suggest very strong 
pre-asymptotic effects in the AdSAT problem. The strength of the finite size corrections hence poses a challenge to 
numerical verifications of our cavity method asymptotic predictions. 

On the other hand, the scaling argument presented in Eq. (28) suggests that the system sizes at which the 
asymptotic behavior starts to be dominant might be quite large (perhaps thousands or more), this is in particular 
true in the vicinity of the satisfiability threshold. Hence in the AdSAT problem, and likely also in other adversarial 
optimization problems, it is particularly important to develop techniques that predict the pre-asymptotic behavior 
and the finite size corrections. We saw from the results on random regular instances with even degree that BP 
predicts the same Bethe entropy for all balanced negation-configurations independently of the system size, hence 
the methods for analysis of finite size corrections and pre-asymptotic effects will have to go beyond the assumptions 
of the cavity method. On the other hand, analysis of the cases where the BP fixed point is factorized might be a 
good playground for development of such techniques. 

7. Discussion and conclusions 

In this paper we studied the adversarial optimization problem and concluded that the most frustrated instances 
of random X-SAT are very close to the ones with balanced configurations of negations. For random regular 3- 
SAT instances this leads to a threshold L ~ 11, starting from which the adversary is able to find unsatisfiable 
configurations of negations (compare to L = 14 for the ordinary random regular 3-SAT). For the canonical 
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(Poissonian) adversarial 3-SAT this leads to = 3.39(1) (compare to ag = 4.2667 for the ordinary random 
3-SAT). 

This result is rather uninteresting from the algorithmic point of view, as balancing negations is an easy problem. 
However, the same method we used here can be applied to more interesting situations, for instance the quantified 
SAT problem. Recall also that the adversarial satisfiability problem was suggested as a problem interpolation 
between random SAT and random quantum SAT. Note, however, that our study leads to the conclusion that the 
adversary SAT is much closer to the classical random SAT than to the quantum SAT. Note that in the large K 
limit the random satisfiability threshold scales as a w 2^ log 2, the same scaling holds for the threshold in the 
random i^'-SAT with balanced negations, since at large K at this value of a the degree of variables is so large that 
the difference between the Poissonian distribution of the number of non-negated variables and balanced negation- 
configurations does not play any role in the leading order in K. On the other hand, the satisfiability threshold of 
the quantum SAT was upper-bounded by 2^ log 2/2 [14, 15], hence quantum effect must be responsible for this 
drastic decrease of the threshold value. 

We obtained our results by studying the large deviations of the entropy in the ordinary random ii'-SAT. In 
particular, an approach leading to equations very similar to the IRSB equations leads to the calculation of the large 
deviations in the case where rare instances are exponentially rare. Exponentially large deviations are common in 
statistical physics. In some cases, sec e.g. [28, 8, 9, 10], the large deviations are rarer than exponentially rare. In our 
study this arises for instance for the regular random if-SAT with balanced negations and even degree. In cases the 
large deviation function decays faster than exponentially with the system size extremely strong finite size corrections 
and pre-asymptotic effects can be induced as we argued in Section 6, where we presented numerical studies of the 
large deviations and of the satisfiability threshold. Interestingly, methods based on the standard cavity method are 
not straightforwardly applicable to study the related finite size correction and pre-asymptotic behavior. It stays a 
theoretical challenge to find out how to describe analytically and algorithmically these pre-asymptotic effects that 
might be crucial for solving some industrial instances of adversarial optimization problems. 
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